08. What Should You Check?

Check all of the things you should check about your dataset when getting started.

SOLUTION:
  • Check each column is a variable, and each row is an individual.
  • Look at the number of rows and columns in the dataset.
  • Check if there are missing values in any of the rows or columns.
  • Check if any columns are of different data types than you would expect (ex: numbers showing as strings)
  • Build exploratory plots like bar charts, histograms, and scatterplots to better understand the data.